Unit 5: Data Matching and Consolidating 





e Combination matching 


Rule-Based Matching 


With rule-based matching, you rely solely on your match and no match scores to determine 
matches within a criterion. 


The following example shows how to set up this method in the Match transform: 


Table 35: Rule-Based Matching 


Criteria RecordA Record B No-match Match Score | Similarity 
Score Score 


eS 


LastName Name [Smith Smitt 


jester -o mary. srt — 
rdr.com 


By entering a value of 101 in the match score for every criterion except the last, the First 
Name and Last Name criteria never determine a match, because two fields cannot be more 
than 100 percent alike. 





By setting the match and no match score for the E-mail criteria with a one point difference, 
any comparison that reaches the last criterion must either be a match or ano match. 


Weighted-scoring matching 


In a weighted-scoring matching, you can assign different weights to individual criterion by 
specifying a contribution value. The higher the value, the more weight that criterion carries in 
determining matches. Fields that are more likely to determine a match should be assigned 
more weight. For example, an SSN or account number may be assigned a higher weighted 
value than an E-mail address. The total of all contribution values must total 100. 


The Match transform generates the contribution score for each criterion by multiplying the 
contribution value with the similarity score. These individual contribution scores are then 
added to get the total contribution score. Matches are determined by comparing the total 
contribution score with the weighted match score. If the total contribution score is equal to or 
greater than the weighted match score, the records are considered a match. If the total 
weighted score is less than the weighted match score, the records are considered a no match. 


You can set the weighted match score in the Weighted Match Score option of the Match Level 
Editor. When you set up weighted scoring, the no match score must be set to —1, and the 
match score must be set to 101. These values ensure that neither a match or ano match is 
determined for the specified criterion. In this example, the contribution value for the Email 
criterion gives it the most importance. 


Table 36: Weighted-Scoring Matching 


Criteria Record A | Record B | No-match| Match Similarity | Contr. Contr. 
Score Score Score Value* Score 


First Mary Mary -1 101 100 25 25 
Name 
Last Smith Smitt -1 101 25 20 
Name 
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